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TITLE OF THE INVENTION 

IMAGE prcx:essing apparatus and METHOD-^ 

BACKGROUND OF THE INVENTION 
Field of the Invention: 

This invention relates to an image processing 
apparatus and method. More particularly, the invention 
relates to an image processing apparatus and method for 
high-efficiency encoding and recognition of images, as 
well as to an image processing apparatus and method for 
detecting the feat\ire points of an image. 
Description of the Related Art: 

Many image encoding techniques have been developed 
on the basis of filtering theory, which is based upon 
analysis of frequency regions, and information-source 
encoding theory. Since the transformation process and 
inverse-transformation process of these theories have 
structures that are symmetrical to each other, the 
inverse transformation is self-evident and it is not 
required that one be aware of the inverse-transformation 
process. For this reason, the conventional encoding is 
referred to as * symmetric encoding' . 

Symmetric encoding will be described in brief. 
First, the performance demanded by symmetric encoding is 
as follows: 



(1) Universality: It is required that various 
types of image information be encodable efficiently. 

(2) Compatibility: It is required that 
compatibility between image media having different 
resolutions be assured. To this end, there is need of a 
scheme through which bit strings for various image media 
having different resolutions are obtained if part of an 
encoded bit string is extracted. 

(3) Fidelity: It is required that a reproduced 
image be close to the original image. This involves 
establishing the concept of distance between images for 
the purpose of evaluating fidelity. 

Rapid progress in symmetric encoding has been made 
in recent years, and sub-band encoding in particular has 
become the focus of much research for the following 
reasons : 

(1) A practical image processing scheme having a 
high degree of coinpatibility and suited to the service 
layer of image communication is capable of being 
implement with ease. 

(2) A visual spatial frequency characteristic 
modeled accurately in the sub-band region can be 
reflected directly in the control of encoding parameters. 

Described next will be the research trends and 



technical level relating to transformation and 
quantization techniques, which are the requisite 
techniques of symmetric image encoding. 

With regard to transformation techniques, studies 
regarding prediction and orthogonal transformation have 
for the most part been completed, and investigations now 
proceeding deal mainly with band separating/ synthesizing 
filters for sub-band encoding. The three capabilities 
required of a band separating/synthesizing filer are as 
follows : 

(1) Perfect reproducibility: In a case where an 
image is reproduced without performing encoding, it is 
required that the original signal be reproduced 
faithfully without causing distortion that accompanies 
folding. 

(2) Linear topological characteristic: It is 
required that the filter have a linear or near linear 
topological characteristic in image encoding. 

(3) Orthogonality: There should be no overlapping 
of filters for band separation. 

At the present time, the only orthogonal filter of 
finite degree which truly satisfies the requirements of 
a linear topological characteristic and perfect 
reproducibility is the Haar filter (which is equivalent 
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to a second-order quadrature mirror filter) , 

With regard to quantization techniques, an encoding 
characteristic that is near ideal has attained an 
implementable technical level . In high-rate 
5 quantization of an unstored information source, 

combining linear quantization and entropy encoding of a 
quantized output makes it possible to achieve 
quantization at a quantization loss of approximately 
0.25 bit/sample in comparison with the rate-distortion 
10 limit. In case of quantization at a lower rate, an 

excellent quantization performance can be realized by 
combining scalar quantization, which is equipped with a 
dead zone, and entropy encoding of a quantized output, 
wherein the entropy encoding includes zero-run encoding* 
15 Furthermore, at a low rate, utilization of vector 
quantization also is possible. If use is made of 
lattice vector quantization or a multi-dimensional 
quantization method in which a code book is furnished 
with a certain algebraic structure, as in the manner of 
20 a permutation code, multi -dimensional, high-efficiency 
quantization closer to the rate-distortion limit can be 
realized over a wide range of from low to high rates. 

However, if an image occupies a major portion of 
the information handled, encoding which relies solely 
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upon geometric information of the information handled is 
insufficient. Accordingly, attempts to carry out 
compression utilizing the structure of an image, namely 
the structure of the actual world, have been studied 
5 from several standpoints. These new atteirpts have 
already shown that there is no assurance that the 
transformation process and inverse- trans formation 
process will be symmetrical. With such encoding, 
therefore, it is required that sufficient consideration 
10 be given to the inverse-transformation process. Such 
encoding is referred to as "^asymmetric encoding". 

The foundation of the concept of a new type of 
image encoding (referred to as ''asymmetric image 
encoding"), sometimes called ''feature extraction 
15 encoding" or "structure extraction encoding" does not 
necessarily depend completely upon filtering theory 
alone, as is the case with conventional image encoding 
techniques. With asymmetric image encoding, symbolizing 
and reproduction algorithms are required to have the 
20 following properties: 

(1) When an input image contains noise, it is 
required that fluctuation produced in a feature 
parameter by such noise be very small. 

(2) A set of input images mapped onto identical 



feature parameters owing to the symbolizing algorithm 
should be as small as possible. If possible, the 
symbolizing algorithm should be one-to-one mapping to 
feature and structure parameters from the input image. 
The feature and structure parameter space cind the 
symbolizing algorithm should be coiriplete. 

(3) The reproduction algorithm should be a stable, 
i.e., continuous mapping. More specifically, when noise 
due to coarse quantization appears in a feature 
parameter extracted by the symbolizing algorithm, and 
even when the calculation accuracy is unsatisfactory, it 
is required that reproduction of an image close to the 
original image be assured. 

(4) It should be possible to calculate both the 
symbolizing algorithm and reproduction algorithm stably 
in nxamerical fashion and in a finite number of steps. 

Asymmetric image encoding presently being studied 
can be classified as follows: 

•Application of IFS (Integrated Function Systems) 
The self -similarity (fractal property) believed to 
exist between partial images of cin input image is 
utilized to find a reduced mapping proper to each 
partial image, and encoding is attempted on the basis 
thereof. At the present time, an affine transformation 



is employed as the mapping. Since a reproduced image is 
given as a set of stable points in a dynamic system, the 
image is decoded by recursive processing, 
•Application of edge information 

It has long been known that edges provide important 
information in terms of visual information processing. 
A complete description of an image can be achieved by 
combining edge information and appropriate additional 
information. In an image encoding/decoding system that 
employs edge information, the position of an edge is 
extracted from an input image by an encoder, a certain 
type of information (e.g., a differential coefficient) 
at this edge position is extracted and the image is 
reproduced by appropriate interpolation performed by a 
decoder . 

•Use of motion information 

A method of estimating a motion vector as a featiare 
parameter and performing encoding using this vector was 
been studied in the field of moving-picture encoding. 
More specifically, a moving vector is predicted, three- 
dimensional motion and a three-dimensional structure 
(irule) are predicted on the basis of the predicted 
vector, and on the basis of these a three-dimensional 
structural (rule) model is constructed. The image is 
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encoded/decoded using this three-dimensional structural 
(rule) model. 

At the present time, it is assimed that when IFS is 
applied in image encoding, the fractal property appears 
5 to the same degree over the entirety of one image. 

However, if a natural image is taken as an example, a 
plurality of objects will be present in one image. 
Therefore, the assumption that there is a fractal 
property over the entirety of the natural image can only 
10 mean that the above method is fairly careless. Further, 
the ert5>loyment of edge information or motion information 
involves many unsolved technical problems in terms of 
pattern recognition, learning of knowledge, etc. 

As mentioned above, an edge is widely accepted as 
15 the feature of an image. Here an edge refers to a 

portion of the image where there is a large amount of 
change in image intensity (luminance, density, 
saturation and hue) . Edge position is detected as the 
position of the maximal value of the first-order 
20 differential coefficient or as the position at which a 
second-order differential coefficient becomes zero. 
(See F, Ulupinar and G. Medioni: "Refining edges 
detected by a LOG operator', Coit^uter vision, graphic 
and image processing, vol. 51, pp. 275 - 298, 1990.) 




A method of detecting a singularity by multi- 
resolution analysis has been proposed, (See S- Mallat 
and W.L. Hwang: "Singularity detection and processing 
with wavelets", IEEE Trans, information theory, vol. 38, 
5 no. 2, pp. 617 - 643, Mar. 1992.) This method performs 
detection based upon the fact that a -singularity 
exhibits a maximal value over all frequency components. 

However, since a first-order differential 
coefficient has the effect of emphasizing noise, a 

10 method of detecting an edge from a first-order 

differential coefficient is susceptible to noise. 
Similarly, a method of detecting an edge from the zero- 
cross point of a second-order differential coefficient 
also is disadvantageous in that the precision of the 

15 position of the detected edge is influenced strongly by 
noise. Further, a method of detecting an edge by multi- 
resolution analysis is highly resistant to the effects 
of noise but involves a large amount of calculation. 
Furthermore, an edge detected by these methods 

20 generally is continuous. If it is attempted to encode 
the original image based upon such an edge, it becomes 
necessary to execute some kind of threshold-value 
processing in order to select a finite niamber of feature 
points from a continuously existing edge. 
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NUMMARY OF THE INVENTION 
An object of the present invention is to provide an 
image processing apparatus and method in which it is 
possible to execute asymmetric encoding that does not 
5 require the extraction of three-dimensional motion and 
three-dimensional structures . 

According to the present invention, the foregoing 
object is attained by providing an image processing 
apparatus, comprising: transformation means for 
10 transforming data space of an input image to multi- 
resolution space and outputting a multi-resolution 
representation of the input image; detecting means for 
detecting a singularity in the input image; extracting 
means for extracting a local pattern, which is formed by 
15 a spatial arrangement of intensities of the multi- 

resolution representation in a partial area containing 
the detected singularity, with regard to partial areas 
of a plurality of sizes; quantizing means for creating a 
quantization code book based upon the extracted local 
20 pattern and replacing the multi-resolution 

representation by a code word using the code book; and 
encoding means for algebraic encoding code data which 
includes position coordinates of the singularity in the 
multi-resolution representation and the code word 



- 11 - 



provided by the quantizing means. 

Another object of the present invention is to 
provide an ixnage processing apparatus and method in 
which image data are retrieved from a partial image by 
applying the aforementioned apparatus and method to 

image recognition- 

According to the present invention, the foregoing 
object is attained by providing an apparatus further 
coT^rising me^ry .eans for storing degree of conformity 
or quantization error, calculated by the quantizing 
means, when the local pattern is allocated to a 
representative vector; and deciding means which, on the 
basis of the degree of conformity or quantization error, 
is for deciding the order relating to the perspective 
depth between any two representative vectors contained.., 
in the code data, wherein the encoding means encodes the 
order relating to the perspective depth. 

A further object of the present invention is to 
provide an image processing apparatus and method 
0 strongly resistant to the effects of noise, in which a 
finite number of feature points can be readily obtained. 

According to the present invention, the foregoing 
object is attained by providing an li-age processing 

apparatus comprising, input means for entering an image, 



- 12 - 



transformation means for transforming data of the input 
image, which is represented by a fxinction on a two- 
dimensional plane entered by the input means, to data of 
a vector field; detecting means for detecting a pole and 
a zero point of image data transformed by the 
transformation means; arithmetic means for calculating a 
finite number of expansion coefficients of a polynomial 
expansion about the pole detected by the detecting 
means; and deciding means which, based upon the position 
of each pole detected by the detecting means, is for 
deciding an area in which a polynomial expansion of 
finite degree obtained by the arithmetic means about 
each pole can be approximated. 

Other features and advantages of the present 
invention will be apparent from the following 
description taken in conjunction with the accompanying 
drawings, in which like reference characters designate 
the same or similar parts throughout the figures thereof. 
BRIEF DE SCRIPTION OF THE DRAWTNGg 
Fig. 1 is a block diagram illustrating the 
construction of an image processing apparatus according 
to an embodiment of the present invention; 

Fig. 2 is a diagram showing an example of a spatial 
sampling interval in. a certain scale; 
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Fig. 3 is a diagram showing an exairple of a local 
pattern of depth A. = 3; 

Fig. 4 is a diagram showing an example of a local 
pattern of depth A,; 

5 Fig . 5 is a diagram showing an example of a set- 

theoretic order relation which holds true between a 
local pattern with depth k and a local pattern with 
depth A, (>k) ; 

Fig. 6 is a diagram showing the manner in which a 
10 perspective order of representative vectors is 

extracted; ^ _ 

Fig. 7 is a block diagram illustrating the 
construction of an image processing apparatus according 
to a second embodiment of the present invention; 
15 Fig. 8 is a diagram showing an example of a 

coordinate transformation by a wide- view lens 
illustrated in Fig. 7; 

Fig. 9 is a diagram showing a path of integration 
performed by a singularity detector shown in Fig. 7; and 
20 Fig. 10 is a diagram showing an example of division 

of an area by a transformation encoder illustrated in 
Fig. 7. 

- DESCRIPTION OF THE PREFERPRn T?^4RnnTMR^F^c; 
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Embodiments of an image processing apparatus 
according to the present invention will now be described 
in detail with reference to the drawings . 
First Embodiment 

Fig, 1 is a block diagram illustrating the 
construction of an image processing apparatus according 
to an embodiment of the present invention. 

As shown in Fig. 1, the apparatus includes an image 
input unit 101, such as an image scanner, for reading 
the image of an original and producing an input image 
151, The image input unit 101 is also capable of 
directly entering linage data from an image processor or 
computer, not shown. The input image 151 is represented 
by Equation (1) shown below. Further, in a case where 
the input image is given as a digital image, it will 
suffice to convert this equation to discrete points. 

f(x\xsR^ (1) 
A transformation unit 102 transforms the data space 
of the input image 151 to multi-resolution space. By 
way of example, multi-resolution space is four- 
dimensional space defined by a one-dimensional scale 
parameter s, a two-dimensional shift parameter b and a 
one-dimensional rotation-angle parameter 0. The input 
image 151 expressed in multi -resolution space is 



- 15 - 



referred to as a multi-resolution representation 152. 
The transformation unit 102 performs a transformation in 
accordance with Equations (2), (3), shown below. In 
Equation (3), R(G) represents a rotation matrix and ^ a 

5 function referred to as an analyzing wavelet, which 

satisfies the condition of Equation (4) below. Further, 
an image f is reproduced in the manner of Equation (5) 
below. 

< f'<l>s.b.e >=/€ix^s,b.eU)f(x) (2) 

<l>s.b.e(x) = s-^(|)(R(e)->(— )) (3) 

s 

= (^^^^/o^yJo'''^^' <l>(rcose,rsme) P< oo (4) 

f = C;7;^Jc|bj;^d9<f,(|>^^^^^ (5) 

10 In actual calculation, a discrete multi-resolution 

space is convenient. In this case, each parameter is 
set in the manner indicated by Equation (6) below. In 
Equation (6), j represents the depth of a local pattern, 
defined later. This differs from the method of dealing 
15 with parameters used ordinarily in that a shift 
parameter in Equation (6) has a width 2Max(0'k-j)^ 
shown in Fig. 2, the spatial sampling interval (the 
offset between an interval indicated at 201 and an 
interval indicated at 202) at a certain scale is equal 
20 to a space width 203 of a wavelet base at a scale finer 
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15 



than the above-mentioned scale by an amount equivalent 
to the depth of the local pattern. 



In order to avoid confusion, real space coordinates 
are adopted as two-dimensional coordinates in real space 
(the image) , and coordinates in multi-resolution space 
are denoted by parameter coordinates. When those having 
rotational symmetry are employed as analyzing wavelets 
<l)x, the multi-resolution space becomes three-dimensional, 
with the axis relating to the angle of rotation having 
omitted. In this embodiment, analyzing wavelets having 
rotational symmetry are employed in order to facilitate 
an understanding of the following discussion. However, 
the discussion would be similar even if the angle of 
rotation were taken into account. In the diagram used 
for the purpose of explanation, real space is expressed 
by one dimension for the sake of simplicity. 
Transformation in case of a analyzing wavelets having 
rotational symmetry is performed as indicated by 
Equation (7) below. 



5 = 2* 



(6) 




(7) 
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A transformation unit 103 transforms the multi- 
resolution representation 152 to a representation of 
local pattern space and outputs a local pattern 153. 
The neighborhood Nx(si,bi) with depth X with respect to a 

parameter coordinate (si = s^^^l^bi) is defined by- 
Equations (8), (9) bfelow. Fig. 3 is a diagram showing 
an example of a local pattern with depth X = 3 . This 

shows that if the parameter coordinates differ, then the 
region covered by the local pattern, namely the real 
space region (the- width b) , and the scale region (which 
corresponds to the frequency region of width s) also 
differ. 

J = 2*. k={k^,k,-l-;Max(P,k,-X)} (8) 
*=*,±((m-i)2*Ax,(«-l)2*A^Xm,«= {l. - .2*-*} (9) 

Thus, the neighborhood Nx(si,bi) with depth \ is 

equal to a quad tree (a binary tree in Fig. 3) including 
the node (si,bi) as the root. The local pattern of 
parameter coordinates (s,b) with depth A, is that 
obtained by making the value of the wavelet coefficient 
«l>sj,bk> at the parameter coordinates correspond to each 
node of Nx(s,b). The local pattern space of depth X is 
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referred to as fxmction space established by defining an 
inner product, which is defined by Equation (10) below 
as the set of the quad tree mentioned above (see Fig, 4) . 
The summation on the right-hand side of Equation (10) 
extends over nodes belonging to the local pattern. The 
local pattern space of depth X will be denoted by T^ and 

elements thereof will be denoted by tn e T^ hereinafter. 

On the basis of information representing the 
parameter coordinates and depth sent from a quantizer 
105, described later, the transformation unit 103 
extracts the corresponding local patterns in discrete 
multi-resolution space and maps them in local pattern 
space of the respectively corresponding depths. 
Furthermore, the transformation unit 103 coxints the 

frequency p(t^) of appearance of each of the local 
patterns t^ and stores this value. 

A singularity detector 1031 detects singularities 
of an image that has been entered by the image input 
unit 101. An edge detected by a Laplacian filter or 
Canny filter or a featxire point of the image may serve 
as a singularity. 




On the basis of the frequency p(tn) of appearance 
of the local patterns mentioned above, the code book 

creating unit 104 registers a local pattern tJi, at which 
the appearance frequency p has exceeded a suitable value 
(threshold value) , in a code book as a representative 
vector, and establishes a relationship between the 
representative vector and a codeword. A jth 

representative vector with depth X will be denoted by d^ 

hereinafter. It should be noted that there can be a 
case in which the threshold value with respect to the 

occurrence frequency p is set for every depth X and a 

case in which the threshold value is set to be the same 

for all depths X. 

A set of elements for which a set-theoretic order 
relation hold true exists between a local pattern with 

depth k and a local pattern with depth X (>k) (see Fig. 

5. An inclusion-relation extractor 1041 connected to 
the code book creation xmit 104 investigates the 
relation between local patterns having different depths. 
The order relationship referred to here is indicated by 
Equation (11) below. When trees coincide, this means 
that the values of wavelet coefficients at each node 
coincide. Though local patterns having different depths 
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are correlated by such an order relation, the actual 
data contains an error. Therefore, representation as 
indicated by Equation (12) below using the inner product 
defined earlier is more realistic. 

Among the partial trees >dj^ , if and only if 

there is at least one sxibtree of 
which coincides with (11) 

df > d^ , if and only if there is at least one 
subtree of d?' for which the inner product with 
is greater than a certain value (12) 
A quantizer 105 compares the multi -resolution 
representation 152 with a representative vector in a 
code book 154. The quantizer 105 performs the 
comparison with regard to local patterns of all depths 
for which representative vectors exist at each of the 
parameter coordinates in multi-resolution space, and the 
local patterns in the neighborhood of each of the 
parameter coordinates are replaced by code words 

corresponding to the representative vector d^ for which 
the above-mentioned inner product is maximized, thereby 
achieving encoding. In a case where the maximal value 
of the inner product does not become greater than a 
certain threshold value, the local pattern which is the 
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object of comparison is adopted as the candidate for a 
representative vector and the parameter coordinates and 
depth thereof are output ted to the transformation unit 
103. As a result of this processing, correspondence 
between appropriate code words and all parameter 
coordinates of multi-resolution space is obtained. 

A redundancy eliminating unit 106 compresses an 
image 155 that has been encoded by the above-described 
local pattern. The data encoded using the 
representative vectors at each of the parameter 
coordinates are redundant in the sense of the order 
relation between representative vectors close to each 
other in real space (see Fig. 5) . Accordingly, among 
the representative vectors supportive of which in real 
space share a common region, the redundancy eliminating 
unit 106 deletes unnecessary representative vectors 
using the set-theoretical order relation extracted by 
the inclusion-relation extractor 1041. 

A perspective-order coirputation unit 107 extracts 
the perspective-order relationship with respect to real 
depth in the real space of real objects represented by 
representative vectors from a signal 156 output ted by 
the redxindancy eliminating unit 106. There exists some 
pairs of representative vectors close to each other in 



the real space supportive of which in real space share a 
common region. Accordingly, these perspective-order 
relationships can be obtained depending upon which of 
two representative vectors in nearer than the other to 
the wavelet coefficients in the common region. This 
will be described with reference to Fig. 6. 

In order to encode local patterns neighboring each 
of the parameter coordinates, the quantizer 105 
calculates the inner product of the representative 
vectors. If a representative vector regarding an object 
situated at the most forward position in real space has 
been acquired and this object is not hidden by anything, 
the inner product between this representative vector and 
a local pattern corresponding to this object will 
coincide with the norm of the representative vector in a 
state in which there is no measured error. Local 
patterns 601 and 602 in Fig. 6 are local patterns 
corresponding to such an object (situated at the most 
forward position) . 

A local pattern 603 shown in Fig. 6 is a local 
pattern corresponding to an object hidden by another 
object. A common region shared by the support of the 
local pattern 603 and that of the other object (i.e., 
the local pattern 602) in real space is bio and bn . In 
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a region other than this shared region (namely a region 
extending from bll to b 13), the inner product between 
the representative vectors corresponding thereto should 
coincide with the norm. In other words, in the shared 
regions blO and bll, it can be predicted that the object 
603 will be hidden by the object 602. Thus, by 
ascertaining the inner product between representative 
vectors at each of the parameter coordinates, the 
perspective-order relationship of the respective local 
pattern can be determined. 

An algebraic encoder 108 algebraically encodes an 
output signal 157 from the perspective-order computation 
unit 107, namely a code word, real space coordinates, 
rotation and scale of a singularity and perspective- 
order information, and outputs a code 158. The 
outputted code 158 is stored in a memory (not shown) or 
transmitted via a line. 

An algebraic decoder 109 algebraically decodes the 
code 158, which has entered from a memory or line. An 
inverse quantizer 110 generates local patterns from the 
code word contained in the algebraically decoded signal 
159 and outputs the local patterns as a signal 160 
indicative of the multi-resolution representation. 

On the basis, of the algebraically decoded real 
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space coordinates, rotation, scale and perspective -order 
information, an image synthesizer 111 combines at least 
two local patterns, which have been generated by the 
inverse quantizer 110, into a single image. The signal 
160 is expressed in multi-resolution space. An inverse 
transformation unit 112 inversely transforms the image 
(the multi -resolution representation) , which has been 
synthesized by the image synthesizer 111, to a function 
in real space and outputs a signal 162 indicative of 
this function. 

An image output unit 113 is a printer or display 
and forms or displays the image represented by the 
signal 162 . The image output unit 113 stores the signal 
162 on a recording medium or outputs the signal 162 to 
an image processor or computer, not shown. 

An example has been described in which the 
transformation iinit 102 transforms the input image 151 
to multi-resolution space. However, it is also 
effective to transform the input image 151 to a multi- 
resolution pyramid. This method involves dividing the 
original image into several (e.g., four) partial images, 
obtaining the mean of the pixel values of each of the 
partial images, subdividing the partial images into 
several partial images and obtaining the mean of each 
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image. By repeating this processing, a multi-resolution 
space approximation of the original image can be 
obtained . 

Thus, in accordance with this embodiment, each 
input image can be expressed by a plurality of code 
words as well as the positions and sizes thereof or by 
the perspective-order relationship of the representative 
vectors. Further, though innumerable images can be 
generated by movement of a finite number of rigid bodies, 
this embodiment makes it possible to compactly express 
an image set of this type. 

More specifically, this embodiment has the 
following effects: 

(1) The perspective-order computation unit 
extracts information indicative of a plurality of code 
words, positions corresponding thereto and/or angle of 
rotation and/ or scale, as well as information 
representing the perspective-order relationship of any 
two of these representative vectors, contained in the 
input image outputted by the quantizer. On the basis of 
the extracted information, the encoder encodes the input 
image. This makes it possible to achieve highly 
efficient encoding. 

(2) The perspective-order computation unit 
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„3res tne local pa«ems o. one i»,e *ose of 

con5>ares cue above, 
.no^er one using the in£on»ation extracted .n (1) 

f ^v,. local patterns of any 
^ a result, the con^«rison of the local 

^ i^^es can be perforK^ with ease. 

,3, If (2) ahove is applied to an i.-ge 

«r_rieval of linage data from 
recognition apparatus, retrieval 

_,ial images can be performed with ease. 

K- ma the encoding apparatus described 
(4) By combining tne eiiv- 

i„ a> above and a decoding apparatus having the 

. 109 the inverse quantizer 110 «hrch 
.0 algebraic decoder 109, the ^,as 

generates local patterns from a plurality 
siting from decoding and the ^ ^^^^ 

..r synthesizing local patterns by positions, rotation 

t-Yie Generated local 
3„gles and sizes corresponding to the ge 

, the information representing the . 
15 patterns and the 1 ^„ two representative 

perspective-order relationship of any tw 

ctirs the amount of data transmitted, if i-.e data 

smitted is greatly reduced and the „u^ 
are to be transmitted, is g . a 

^ r^aoable of being stored in a 
of items of data (images) capable 

20 memory device can be increased. 

Second Embodiment 

Pig. 7 is a bloc^ diagram illustrating the 
..nstruction of » .nage processing apparatus according 

x: present invention, 
to a second embodiment of the presen 



As shovTn in Fig. 7, the apparatus includes cin image 
input unit 1108 having a wide-view lens 1101, an array 
sensor 1102 and an input -parameter controller 1106, the 
details of which will be described later. Light that 
has passed through the wide-angle lens 1101 impinges 
upon the array sensor 1102, which converts the light to 
an iifiage signal and outputs the image signal. 

A transformation unit 1103 applies processing, 
described later, to the image signal that has entered 
from the image input unit 1108, A singularity detector 
1104 applies processing, described later, to a signal 
that has entered from the transformation unit 1103 . A 
transformation encoder 1105 applies processing, 
described later, to a signal that has entered from the 
singularity detector 1104 and outputs a code to an 
information processor 1107 . 
[Wide-view lens 1101] 

Fig. 8 is a diagram showing an example of a 
coordinate transformation performed by the wide-view 
lens 1101. In Fig. 8, x represents the radius of a 
polar coordinate system on an image plane situated on 
the front side of the input system, an t represents the 
radius of the polar coordinate system after a 
transformation performed by the wide-view lens 1101. 
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Since this optical system is rotationally syrnmetric, the 
description given below will rendered solely with regard 
to the radial direction. 

If we let ^ represent the angle of incidence with 

respect to the optic axis and ^ the focal length of a 
lens having the radius of the circle shown in Fig. 8, 
the radii t and x will be represented by the following 
equations, respectively: 

/ = 2^tan(^/2) (13) 
x = ^tan(0) (14) 

On the basis of Equations (13), (14), the radius x 
can be expressed by the equation below as a function of 
the radius t. This is the coordinate transformation in 
the wide-view lens 1101. 



X = 



(15) 



'ihe pattern (function) of frequency f(0), in the 
radial direction, which spreads oyer the entirety of the 
image plane is changed to the following function by the 
wide-view lens 1101: 



Of covirse, in a case where an arbitrary object 
present in three-dimensional space is adopted as the 
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subject, it will suffice to know only the relationship 
between the incidence angle <() and the radius t. 

Accordingly, the coordinate transformation becomes as 
follows : 

r = 2^tan(0/2) (17) 
The coordinate trans foinnat ion performed by the 
wide-view lens 1101 described above is a conformal map* 
Accordingly, singularities not present in the original 
image do not arise. 
[Array sensor 1102] 

If we let <|)j (x) represent the integral kernel, the 
output gj of a jth (coxmting from the center of the 
optic axis) eurray sensor 1102 will be represented by the 
following: 

gj =«l>jixit;mfi<t.^))> 

= f^dt^Mt.^y)nx(t;^y) (18) 

[Transformation lanit 1103] 

The transformation unit 1103 causes a crradient V to 

act upon an image f(x,y) which is a function in two- 
dimensional space, thereby generating a vector field. 

g(x.y) = Vfix.y) 

. df . df 
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where el,e2 are components of basis. 

Fiirther, g(x,y) can be expressed as follows: 

g(z)=r(z)exp(0(z)) (20) 
z=x + /y (21) 

r(z)=| I V/(x,3;)||= ^(df I dxf^idf I dyf (22) 
0(2)=arg(V/(x,>;)) = arctan(^^) (23) 

Complex representation is used in the description 
given below. 

[Singularity detector 1104] 

The singularity detector 1104 detects singularities 
by utilizing the argument principle. The argument 
principle will now be described. Let a function f (z) be 
a meromorphic in a simply-connected region D, and let C 
represent a simple closed curve having a positive 
direction within D. Assxame that there is neither a zero 
point nor pole present on the closed curve C. In the 
interior of the closed curve C, assume that the function 
f (z) has a zero point of order at each of a^ (k = 1, 

2, , m) and a pole of order at each of bk (k = 1, 

2, n) . When starting from a point Zq on the closed 

curve C and returning to the point Zq upon making one 
full revolution of the closed curve, the amount of 
increase Ac arg[f(z)] of the argument of the function 
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f (z) is given by the following equation if we let Nz, Np 
represent the niunbers of zero points and poles, 
respectively, within the closed cxirve C: 

Acarg(/(z)) = 2ju{N,if)- N^(f)} (24) 

A method of utilizing the argument principle to 
detect a feature point of a vector field {gia,n>m,n 
quantized in space will now be described. 

Step 1: Any point z^^p = Tfk,p-exp(e)c,p) in a defined 

region of an image is selected. 

Step 2: A circle C centered on Zk,p and having a 

suitable radius p is considered. 

If this operation is performed, the following 
equation holds true with respect to a number Nz(g) = Xxk 
of zero points and a number J^(z) = Xjlk of poles, which 
reside within the circle C, of the complex fxinction g(z) 
in consideration of their orders obtained by the vector- 
field constructing unit (transformation unit 1103) in 
accordance with the argument principle. 

N,ig)-N,(g)^^f^^ (25) 

Accordingly, by considering a circle having a 
radius of one pixel and centered on each pixel of a 
given image, the difference between the numbers of zero 
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points and poles at all pixels is obtained using the 
above-described argxunent principle. At this time, the 
zero points or poles included in each circle are 
considered to be one at most. Therefore, a zero point 
is included when the right side of Equation (25) is 
positive and a pole is included when the right side of 
Equation (25) is negative. Accordingly, a center pixel 
when a pole is determined to be present is detected as 
the position of a singularity. 

Here Ac arg[g(z)] is calculated in the manner 
described below (see Fig. 9, which illustrates an 
integration path) . The integration path may be a closed 
curve and is not limited the size qhown in Fig. 9. 
Acarg(g(z))|^,^^ 

+y(g(^»-i^i).g(2 ))+y(g(z«_v.).«(^i»-i^)) 

mg(Za»i^iXg(.z„^y)+Y(giZn^A^x g(z^i^y) (26) 

y(x,>') = j"'"'"^^ ^ argx-arg^^^r 
[ arg y-aigx otherwise 

[Transformation encoder 1105] 

In order to divide the defined region of a vector 
field into a plurality of areas based upon the positions 
of poles, it will suffice to divide the defined region 
in such a manner that only one pole, for example, is 
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m 



included (see Fig, 10, which shows an example of 
dividing a region) . If this expedient is adopted, 
convergence of the Laurent series aroiind the pole is 
guaranteed in each region except the pole. 
5 In order to fianctionally approximate the 

partitioned vector field around each pole in the range 
of the region corresponding thereto, if the positions of 
the poles and zero points within the partial area Dj as 
well as the order are already known, the function w(z) 

10 can be expressed by Equations (28) through (30) below 

based upon the pole positions and orders aj,mj (j = 1, 2, 
— , J) and the zero-point positions and orders bk,nk (k 
= 1, 2, • • * , K) . In a case where the zero point 
position and order cannot be specified, an orthogonal 

15 polynomial expansion can be en^loyed. By using the 

Laurent series expansion around each pole to approximate 
each region, a reduction in error can be expected in a 
case where a large area is expressed by one function • 

1 X >Kt7) 



20 [Input-parameter controller 1106] 
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The input -parameter controller 1106 controls the 
rotational angle of the optic axis of wide-view lens 
1101 based upon control information of the rotational 
angle of the optic axis that has entered from the 
information processor 1107. The information processor 
1106 calculates the amount of control of the rotational 
angle of the optic axis in such a manner that a 
singularity is taken on the optic axis. 

Thus, by regarding visual information, which serves 
as a fiinction on a two-dimensional plane, as one 
variable complex fxinction, it is assured that the poles 
and zero points of the complex function will be isolated 
so long as the function is not a constant. In a case 
where the function is a constant, it will not contain 
any information and may therefore be ignored. 
Accordingly, in accordance with this embodiment, a 
finite nxjmber of singularities can be obtained naturally 
and with facility. 

The apparatus is strongly resistant to noise. For 
example, impulse noise is detected as an edge in the 
method that relies upon both first-order and second- 
order differentiation. However, in the method of 
generating a vector field and detecting the singularity 
thereof, impulse noise is detected as a zero point. 




Accordingly, if a pole is employed as a singularity, as 
mentioned above, impulse noise has no effect whatsoever 
upon singularity detection. Adopting a pole and not a 
zero point as a singularity is appropriate from the 
viewpoint of function approximation. The reason for 
this is that the deciding factor of converging radius in 
case of series expansion is the position of a pole; a 
zero point plays no part whatsoever. 

Thus, this embodiment can be suitably applied to 
image processing such as extraction of feature points 
and encoding/decoding of images. 

Other Embodiment 

The present invention can be applied to a system 
constituted by a plurality of devices (e.g., host 
computer, interface, reader, printer) or to an apparatus 
comprising a single device (e.g., copy machine, 
facsimile) . 

Fvirther, the object of the present invention can be 
also achieved by providing a storage mediiam storing 
program codes for performing the aforesaid processes to 
a system or an apparatus, reading the program codes with 
a computer (e.g., CPU, MPU) of the system or apparatus 
from the storage medium, then executing the program. 



In this case, the program codes read from the 
storage medivmi realize the fxinctions according to the 
embodiments, and the storage mediiam storing the program 
codes constitutes the invention. 

Further, the storage medium, such as a floppy disk, 
a hard disk, an optical disk, a magneto-optical disk, 
CD-ROM, CD-R, a magnetic tape, a non-volatile type 
memory card, and ROM can be used for providing the 
program codes. 

Furthermore, besides aforesaid functions according 
to the above embodiments are realized by executing the 
program codes which are read by a computer, the present 
invention includes a case where an OS (operating system) 
or the like working on the computer performs a part or 
entire processes in accordance with designations of the 
program codes and realizes functions according to the 
above embodiments. 

Fxirthermore, the present invention also includes a 
case where, after the program codes read from the 
storage medium are written in a function expansion card 
which is inserted into the computer or in a memory 
provided in a function e^^ansion unit which is connected 
to the computer, CPU or the like contained in the 
function expansion card or unit performs a part or 
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entire process in accordance with designations of the 
program codes and realizes functions of the above 
embodiments . 

As many apparently widely different embodiments of 
the present invention can be made without departing from 
the spirit and scope thereof, it is to be understood 
that the invention is not liiciited to the specific 
embodiments thereof except as defined in the appended 
claims* 



